# 4-bit Quantization
Josiefied Qwen3 30B A3B Abliterated V2 4bit
This is a 4-bit quantized version converted from the Qwen3-30B model, suitable for text generation tasks on the MLX framework.
Large Language Model
J
mlx-community
194
1
Medgemma 27b Text It 4bit
Other
MedGemma-27B-Text-IT-4bit is an MLX-format model converted from Google's MedGemma-27B-Text-IT model, specifically optimized for medical and clinical reasoning tasks.
Large Language Model
M
mlx-community
193
3
Moondream 2b 2025 04 14 4bit
Apache-2.0
Moondream is a lightweight vision-language model designed for efficient cross-platform deployment. The 4-bit quantized version released on April 14, 2025 significantly reduces memory usage while maintaining high accuracy.
Image-to-Text
Safetensors
M
moondream
6,037
38
Turkishlaw
Apache-2.0
A Turkish law-specific language model based on Qwen3-14B, fine-tuned using the LoRA method and employing 4-bit quantization technology
Large Language Model Supports Multiple Languages
T
OrionCAF
19
3
QWEN 3B INSTRUC Medical COT SFT 2kstep 4kcol
Apache-2.0
A 3B parameter instruction fine-tuned model based on Qwen2.5 architecture, optimized for training speed using Unsloth and Huggingface TRL library
Large Language Model
Transformers English

Q
hailong18102002
30
1
Qwen3 4B Rpg Roleplay
Apache-2.0
A roleplay dialogue model fine-tuned based on Qwen3-4B, excelling in generating coherent dialogues that align with character traits
Large Language Model English
Q
Chun121
1,657
6
Mistral 7B Instruct V0.3 Forensics V1
This model is a fine-tuned version optimized from Mistral-7B-Instruct-v0.3, specifically designed for Q&A tasks in the field of forensic investigations, supporting advanced forensic reasoning and rapid knowledge retrieval.
Large Language Model
Transformers

M
gerasmark
28
2
UI TARS 1.5 7B 4bit
Apache-2.0
UI-TARS-1.5-7B-4bit is a multimodal model focused on image-text-to-text conversion tasks, supporting the English language.
Image-to-Text
Transformers Supports Multiple Languages

U
mlx-community
184
1
VL Rethinker 72B 4bit
Apache-2.0
VL-Rethinker-72B-4bit is a multimodal model based on Qwen2.5-VL-7B-Instruct, supporting visual question answering tasks, and has been converted to MLX format for efficient operation on Apple devices.
Text-to-Image
Transformers English

V
mlx-community
26
0
Gemma 3 27b It Qat 4bit
Other
Gemma 3 27B IT QAT 4bit is an MLX-format model converted from Google's original model, supporting image-to-text tasks.
Image-to-Text
Transformers Other

G
mlx-community
2,200
12
Space Voice Label Detect Beta
Apache-2.0
Fine-tuned version based on Qwen2.5-VL-3B model, trained using Unsloth and Huggingface TRL library, achieving 2x inference speed improvement
Text-to-Image
Transformers English

S
devJy
38
1
Qwen2.5 Omni 7B GPTQ 4bit
MIT
A 4-bit GPTQ quantized version of the Qwen2.5-Omni-7B model, supporting multilingual and multimodal tasks.
Multimodal Fusion
Safetensors Supports Multiple Languages
Q
FunAGI
3,957
51
Gemma 3 27b It Abliterated Mlx 4Bit
This is a 4-bit quantized version converted from the mlabonne/gemma-3-27b-it-abliterated model, optimized for the MLX framework.
Large Language Model
Transformers

G
sistabossen
119
0
Travelbot
Apache-2.0
Llama model trained with Unsloth and Huggingface TRL library, achieving 2x inference speed improvement
Large Language Model
Transformers English

T
kitty528
9,146
2
Llama 3.2 11B Vision Radiology Mini
Apache-2.0
Vision instruction fine-tuned model optimized with Unsloth, supporting multimodal task processing
Text-to-Image
Transformers English

L
mervinpraison
39
2
Sales Conversations Unsloth Llama 3.1 8B Instruct
Apache-2.0
4-bit quantized version based on Meta-Llama-3.1-8B-Instruct, efficiently trained using Unsloth and TRL libraries
Large Language Model
Transformers English

S
vakodiya
22
1
Qwen2 Audio 7B Instruct 4bit
This is the 4-bit quantized version of Qwen2-Audio-7B-Instruct, developed based on Alibaba Cloud's original Qwen model. It is an audio-text multimodal large language model.
Audio-to-Text
Transformers

Q
alicekyting
1,090
6
Llama3.1 8b Instruct Summarize Q4 K M
Apache-2.0
A 4-bit quantized version based on Meta-Llama-3.1-8B-Instruct, trained using Unsloth and Huggingface TRL libraries, achieving 2x speed improvement.
Large Language Model English
L
raaec
107
0
Qwen2 1.5B Summarize
Apache-2.0
A specialized summarization model fine-tuned for 2 rounds based on Qwen2-1.5B-Instruct
Text Generation
Transformers English

Q
thepowerfuldeez
228
1
Omost Dolphin 2.9 Llama3 8b 4bits
Omost's instruction fine-tuned model based on Llama3-8B, pre-trained with the Dolphin-2.9 dataset and quantized in 4-bit NF4 format.
Large Language Model
Transformers

O
lllyasviel
106
6
Llama3 8B Medical
Apache-2.0
A 4-bit quantized version of the LLAMA-3-8B model fine-tuned for medical Q&A
Large Language Model
Transformers English

L
ruslanmv
132
11
Llama3 Toxic 8B Float16
Apache-2.0
A text generation model fine-tuned based on unsloth/llama-3-8b-bnb-4bit, trained using Unsloth and TRL libraries with 2x speed improvement
Large Language Model
Transformers English

L
theminji
19
4
Llama 3 70B Uncensored
Apache-2.0
This is a text generation model fine-tuned using Unsloth and TRL libraries on the Llama-3-70B model, achieving 2x faster training speed.
Large Language Model
Transformers English

L
Dogge
171
18
Cogvlm Grounding Generalist Hf Quant4
Apache-2.0
CogVLM is a powerful open-source vision-language model supporting tasks like object detection and visual question answering, featuring 4-bit precision quantization.
Image-to-Text
Transformers

C
Rodeszones
50
9
Tinyllama NSFW Chatbot
Apache-2.0
A fine-tuned language model based on the 4-bit quantized version of TinyLLaMA, efficiently trained using Unsloth and TRL libraries
Large Language Model
Transformers English

T
bilalRahib
612
7
Internlm Xcomposer2 7b 4bit
Other
InternLM-XComposer2 is a vision-language large model (VLLM) based on InternLM2, featuring advanced image-text understanding and creation capabilities.
Image-to-Text
Transformers

I
internlm
74
10
Internlm Xcomposer2 Vl 7b 4bit
Other
A vision-language large model based on InternLM2, with outstanding image-text understanding and creation capabilities
Image-to-Text
Transformers

I
internlm
1,635
27
Meditron 7B AWQ
Meditron 7B is a large language model in the medical field developed by the EPFL LLM Team. It is further pre-trained based on Llama-2-7B and focuses on medical knowledge encoding and clinical decision support.
Large Language Model
Transformers English

M
TheBloke
38.22k
3
Llama 2 7b Mt French To English
MIT
A LoRA adapter fine-tuned on the Meta Llama 2 7B model, specifically designed for French-to-English text translation tasks.
Machine Translation Supports Multiple Languages
L
kaitchup
268
3
Pygmalion 6b 4bit 128g
Openrail
A 4-bit GPTQ quantized model based on Pygmalion-6B, suitable for dialogue generation tasks, supporting English text generation
Large Language Model
Transformers English

P
mayaeary
40
40
Featured Recommended AI Models